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Abstract 

The problem of separating the jump part of a multidimensional regular affine process from 
its continuous part is considered. In particular, we present an algorithm for a nonparametric 
estimation of the jump distribution under the presence of a nonzero diffusion component. 
An estimation methodology is proposed which is based on the log-affine representation of 
the conditional characteristic function of a regular affine process and employees a smoothed 
in time version of the empirical characteristic function in order to estimate the derivatives of 
the conditional characteristic function. We derive almost sure uniform rates of convergence 
for the estimated Levy density and prove that these rates are optimal in the minimax sense. 
Finally, the performance of the estimation algorithm is illustrated in the case of the Bates 
stochastic volatility model. 

Keywords: Regular affine processes, estimation, empirical characteristic function 

1 Introduction 

The problem of nonparametric statistical inference for jump proce sses or more general l y for s emi- 
martingales models has long history and goes back to the works of Rubin and Tucker ( 19591 ) and 
Basawa and Brockwelll (Il982h . T he recent revival o f inte rest in this topic documented, for exam- 
ple, in Figueroa-Lopez ( 20041 ) and Figueroa-Lopez ( 20091 ). is mainly related to a wide availability 
of financial and economical time series data and new types of statistical issues that have not been 
addre ssed before. For instance, there is now considerable evidence (see, e.g. ICont and Mancini 
( 20071 )) that most financial time series contain a continuous martingale component. This is why 
a number of works appeared in recent years deal with the problem of estimating some character- 
istics of jumps in the general semimartingale models with a nonzero continuous part. Without 
further assumptions such kind of statistical inference is not possible because the behavior of 
the jump component becomes statistically indistinguishable from the behavior of the diffusion 
part as the activity of small jumps tends to infinity. In the case of Levy processes the activ- 
ity of small jumps can be measured by the so called Blumenthal-Getoor index. The nearer is 
the Blumenthal-Getoor index to 2, the more difficult becomes the problem of separating the 
jump component from the diffusion comp onent and hence th e prob lem of statistical inference 
on the characteristics of jumps (see, e.g. Neumann and Reifil ( 2007 )). Suppose that values of 
a log-return process X(t) = log(S(t)) on a time grid -it = {to,ti, . . . , ijv} are observed. If | vr | 
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is small (high-frequency data) then a large increment X(ti) — X(ij_i) indicates that a jump 
occurred between time and t{. Based on this insight and the continuous-time observation 
analogue, inference for vari ous characteristics of jumps o f the underlying semimartingale can be 
conducted. For example, in lA'it-Sahalia and Jacodl (|2008l ) the problem of statistical inference on 
the degree of jump activity in the general semimartingale models based on high-frequency data 
is considered. They proposed an estimation procedure which is able to "see through" the contin- 
uous part of semimartingale and consistently estimate the degree of small jump activity under 
some restrictions on the structure of the underlying semimartingale. In fact, these restrictions 
keep the highest degree of activity of small jumps away from 2, thus allowing for a consistent 
estimation of the degree of jump activity. 

In this work we consider the problem of estimating the characteristics of jumps in the class 
of regular affine models with a nonzero continuous part. Affine Ito-Levy processes are nowadays 
rather popular in financial and econometric modeling. Due to their analytical tractability on the 
one hand and their rather rich dynamics and implied volatility patterns on the other hand, they 
are particularly useful in the context of option pricing. Many well known models such as Heston 
and Bates stochastic volatility models fall into the class of affine Ito-Levy processes. Option 
pricing in these models can be conveniently done via the Fourier meth od. The literature on affine 
proces ses is rather extensive. Let us mention two semi nal papers of iDuffie. Pan and Singleton 
( 2000l ) and Duffie. Filipovic and Schachermaver ( 2003 ). where t heoretical analysis of r egula r 
affi ne models has been conducted. M ore recent literature includes iGlasserman and Kiml (|2007l ) 
and Keller-Ressel and Steiner ( 20081 ). 

We propose an approach based on the log-amne representation of the conditional character- 
istic function of a regular affine process. This representation together with some transformation 
allows one to consistently estimate the characteristics of the jump component under some prior 
bound on the highest degree of activity of small jumps. We present uniform convergence rates 
for the so constructed estimate of a transformed Levy density which turn out to be optimal 
in minimax sense. As a main technical result that may be of independent interest we provide 
uniform convergence rates for a smoothed in time version of the empirical characteristic function. 

The problem of parametric estimation of the characteristics of an affine jump-diffusion pro- 
cess (processes with finite intensity of jumps) X(t) from a high-frequency t ime series of the 
asse t S(t) = exp (X(t )) in has been r ecently considered in the literature by Singleton ( 2000l ) 
and Bates! ( 2005 ). In Singleton ( 2000l ) the general method of moments (GMM) based on the 
empirical cha racteri s tic fun ction is employed and asymptotic properties of the estimator are 
investigated. Bates! ( 20051 ) proposed a filtration-based maximum likelihood methodology for 
estimating the parameters and realizations of latent affine processes. Filtration is conducted 
in the transform space of characteristic functions, using a version of Bayes rule for recursively 
updating the joint characteristic function of latent variables and the data conditional upon past 
data. Since the characteristics of an affine process are a priori an infinite-dimensional object, a 
parametric approach is always exposed to the problem of misspecification, in particular when 
there is no inherent economic foundation of the parameters and they are only used to generate 
different shapes of possible jump distributions. The problem of a semi-parametric inference for 
the characteristics of some special type affine processes X(t) has been studied in the literature 
as well. In the case of high-frequency observations the problem of a nonparametric inference on 
the Levy measure for time changed L evy processes, b e longin g sometimes to the class of affine 
processes, has been recently studied in Figueroa-Lopezl ( 20091 ) where consistency was proved. In 
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Ornstein-Uhlenbeck process, affine process with zero diffusion part, is considered. The authors 
assume that the corresponding jump component is self-decomposable, i.e. the degree of the 
jump activity is less than 1. They propose a cumulant M-estimator to estimate the so called 
canonical function of the driving self-decomposable process from low-frequency data and prove 
consistency of the resulting estimate. As to the special case of Levy processes, a semi-parametric 
estimation problem s for L evy models under low-fr equency data has r e cently been studied in 



estimation problem s lor L /evy models under low-ir equency data nas r e cently been studied m 
Neumann and ReifJ ( 20071 ). Let us mention that in Neumann and Reifc ( 20071 ) a diffusion com- 



ponent is assumed to be known. So, the above works do not encounter the problem of separating 
diffusion and jump components as the activity of small jumps increases. Furthermore, the chal- 
lenge of devising nonparametric estimation methods for the Levy density in general regular affine 
models lies in the fact that the structure of the conditional characteristic function of a regular 
affine process has not such explicit form as in the case of Levy processes and is related to the 
parameters of the underlying affine process not directly but via a Riccati equation. The last 
but not the least: the increments of an affine process are not any longer independent, hence 
advanced tools from time series analysis have to be used. 

The paper is organized as follows. In Section [2] we recall the definition and basic properties 
of regular affine processes. In Section we formulate a spectral estimation algorithm which 
applies as soon as an estimates of the corresponding conditional characteristic function and its 
time derivatives are available. Section [5] is devoted to a nonparametric estimation of the above 
characteristic function and its derivatives. We derive uniform rates of convergence and prove 
that these rates are optimal in minimax sense. Section 5 concludes the paper. 



2 Affine processes 



Let us fix a probability space (Q, J-, P) and an information filtration (J 7 t)t>o- The process X(t) 
is an affine process if it is stochastically continuous, time-homogenous Markov process with state 
space T> C such that the characteristic function of X(t) given X(0) is an affine function of 
the initial state X(0): 



(1) (j){u\s,x) := E ( e iuT *( s ) X(0) = x) = e Mu,s)+x T M»,s) } 

The affine process X(t) is called regular, if the derivatives 



u G 



F (u) :-- 



&i/) (u,s) 



On 



Fi{u) :-- 



dipi{u,s) 



s=Q 



ds 



s=0 



exist and are continuous a t u = 0. The following theorem provides the characterization of affine 
processes and is proved in Duffie. Filipovic and Schachermaver ( 20031 ). 

Theorem 2.1. If (X(t))t>o is a regular affine process, then ipo and ipi satisfy the generalized 
Riccati equations 



(2) 
(3) 



dipo (u,s) 

ds 
dij)i{u,s) 

ds 



Fo(^(u,s)), ^oM) = 0, 
Fi(V>i(u, s)), 0) = m, 
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where 

F (z) = (a(°)*,z) + (z,0 (O) )-7 (O) + f (e zTu -l-( X (u),z)) ^°\du) 

JV\{0} v J 

Fu(z) = {afz,z) + {z^f ) )- 1 f ) +f (e zTu -l-( X (u),z)) ^\du) 

for j = l,...,d and x(u) = {xi{u), . . . ,Xd(u)) with 

_ |0, w fc = 0, 

| (1 A |itfe|)|^j, otherwise 

for k = l,...,d. Here a = (oW.aW) G R dxd x R dxd x rf , /3 = ^W) eR rf x M dxd , 7 

(7*- -*, 7^ 1 - ) ) eRx ]R d and z/ = , . . . , z^ 1 ) is a vector of measures on M. d , satisfying 

\\x(u)\\ 2 2 vW(du) <oo, I \\ X {u)\\lvf\du) <oo, j = l,...,d, 
C\{0} Jv\{o} 



where here and in the sequel \\x\\2 '■= \j x \ + ■ ■ ■ + x \ f or anv x ^ ~^ d - 

Under some admissibility c onditions a regular affine process X( t ) is a Feller process in the 
domain T> = x R d_m (see iDuffie. Filipovic and Schachermaverl 120031 . Section 2) with the 
infinitesimal generator 

(D„ d 2 f(x) 



Am - E (4? + E |^ + E # 

fc,Z=m+l \ i=l / i=l 



d r\ p / \ TYl 771 f\ p / \ & ^ O ^ / "> 

tE^tEEfrft E E 

axfc ' ax/; ^— ' ' axfc 

fc=l k=l i=l k=m+l i=m+l 



+ 1 [fix +0- m - v xfc(ov fc /(x) ) i/^(de) 

+ Vxi / /(x + 0-/(a?)-Xi(0Vi/(x)- ^ Xfc(0Vfc/(x) 



where < m < d and / G C 2 (T>). In the sequel we assume that the above admissibility conditions 
hold. Moreover, we restrict our analysis to the class of regular affine processes with 

(4) *f> S ... = *« S 0. 

On the one hand this assumption reduces the dimensionality of the jump component of X. On 
the other hand the class of affine models satisfying @ remains rather large and includes such 
well known models as Heston, Bates and Barndorff-Nielsen and Shephard stochastic volatility 
models. 

The goal of this paper is to investigate the problem of a nonparametric inference for the 
Levy measure based on a time series of the asset prices S{t) = (Si(t), . . . , Sk(t)) that follow 
exponential regular affine model: 

(5) S k (t) = S k (0)e x ^\ te[0,T], k = l,...,d, 

where X(t) = (Xi(t), . . . ,Xd(t)) is a regular affine process with characteristics x = ( Q > Pili v )- 
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2.1 Regularity properties of affine processes 

In the sequel we shall need to know how fast the derivatives of the conditional characteristic 
function of an affine process can grow. The following lemma provides the corresponding bounds. 

Lemma 2.2. Let for some natural k > the Levy measure satisfies 
(6) / \\x\\ k uW(dx) <oo, 

J{\\x\\>l} 

then functions ipo(u,s) and ipi(u,s) from the representation ([T]) are in C k+l (Nl+) as functions 
of s. Moreover, for any fixed i£P and any u G M. d the following estimates hold 



(7) sup 

se[0,T] 



u\s, x) 



<C\\u\\f, l = 0,...,k + l, 



ds l 

where C is a positive constant depending on T and x. 

3 Spectral estimation 

If values of the log-return process X(t) on a time grid ir = {to,ti, . . . ,tjy} are observed and 
| vt" | is small (high-frequency data) then a large increment X(fi) — X(tj_i) indicates that a jump 
occurred between time and tj. Considering different functions of increments and averag- 
ing over all increments, we can estimate various integrals w.r.t. the underlying Levy measure 
and hence various characteristics of the jump component. Another possibility to estimate the 
characteristics of jumps is to make use of the spectral representation (prj by first estimating 
the c.f. (p(u\s,x) and then its derivative d s log((f)(u\s, x)) at the point s = 0. Finally, the rep- 
resentation of Theorem 12.11 and some transformation allow one to eliminate the diffusion part 
of X and to estimate the Levy measure Suppose that for any fixed x G T> two sequences 
of estimates {4>n(u\s,x)} and {4> S} n(u\s, x)} of the conditional characteristic function <p(u\s,x) 
and its derivative d s <j)(u\s,x) respectively, where u € M rf and s € S := [0,T] for some T > 0, are 
constructed. At this stage it is not important how they were obtained. We assume only that 
the estimates (/>n(u\s,x) and (f> s n(u\s,x) are uniformly consistent in u, i.e. for any fixed x G T> 
and s G S 

(8) sup w(\\u\\)\(f) N (u\s,x) - 4>(u\s,x)\ = O a . s .(Ccyv)> 

(9) sup w(\\u\\)\(f> S N(u\s,x) - d s (/)(u\s,x)\ = O a . s . (Ci.jv) , 



where w is a non-negative weighting function on R + , ||u|| = maxj = i ^ \ui\ for any u G M. d and 
Cat := max{£o,7v, Ci,n} — > as — > oo. In the next sections we discuss how to construct such 
consistent estimates from a time series data under random sampling using a smoothed it time 
version of empirical characteristic function. 

3.1 Algorithm 

Our aim is to consistently estimate the Levy measure z^ - 1 using the estimates 4>n(u\s,x) and 
(f>s,N( u \ s i x )- Note that in this formulation other parameters of the affine process X(t) are 
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viewed as nuisance components which are to be filtered out during estimation. The main idea 
of estimation algorithm is as follows. Denote ip(u\s,x) := tpo(u,s) + x T tpi(u, s) and introduce 
function 

(10) *(«):=/ (d 8 i/}(u\0,x) -8 8 if>(u + w\0,x)) dw 

J[-l,i\ d 

that fulfills (due to © and Q) 



*(«) = y tr(a(°)) + |" X>; tr(af) + jf e^*, 



p^Xdz) 



where 

(11) p(°>(dz) := 2 d {[(l- S -^)^(dz) 

k=l ^ Zfc ' 

is a finite measure. Function \&(u) satisfies, due the Riemann-Lebesgue theorem, the following 
asymptotic relation 

lim = — tr(aW) + — V Xj tr(aj 1} ) =: C. 

\\u\\— >oo 3 3 ^ — ' J 

3=1 

So, in order to reconstruct a transformed Levy measure p^ and hence z/( ) one can first estimate 
function \&(u) then estimate its limit as \\u\\ — > oo and finally the Fourier transform of p^ using 

cod. 

Remark 3.1. Transformation ()10p whi ch transforms the Levy me asure to a finite measure is 
not unique. Motivated by the results of Neumann and Reifj ( 2007 ) for the case of one dimensional 



Levy processes, one can consider the derivative ggr^|~i) instead. However, the latter type 

of transformation would require existence of the second order moments of the process X(t). 
Moreover, empirical results indicate that integration in (|10p may reduce the variance of an 
estimate for dtp(u\0,x). 

Remark 3.2. If we are interested in reconstructing only one particular component of v then it 
is enough to construct the estimates for the corresponding marginal conditional c.f. and its 
derivative in time. These estimates in turn can be constructed using only a time series of the 
corresponding component of X (see Section [6] for numerical illustration) . 

In the sequel we shall assume that the measure p( ' is absolutely continuous w.r.t. Lebesgue 
measure on W d and possesses bounded density denoted (with some abuse of notations) by 
p( ty(x). In fact, this means th at the index of jump activity of the process X(t) introduced 



m 



Ai't-Sahalia and Jacod (2008) for general semimartingales is smaller than 1, i.e. 



k=l,...,d 



max inf < r > : / \xk\ r ^°\dx) < oo > < 1. 



'{|a5i|>l,...,|a! fc _i|>l, |xfc|<l, \x k+1 \>l,...,\x d \>l} 

Note that according to the admissibility conditions in lDuffie. Filipovic and Schachermaver (120031 ) 
it must hold 

max inf < r > : / \xk\ r v^\dx) < oo > < 1. 



fe=l,...,m 



{|xi|>l,...,|a; fc _i|>l, |ac fc |<X, |x fe+1 |>l,..,|a; d |>l} 
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So, our assumption puts an upper bound on the degree of activity of small jumps for components 
X m+ i(t), . . . ,Xd(t). Namely, we consider the case of medium activity of small jumps. The case 
of highly active small jumps can be treated in our framework as well. In this case, however, we 
can not any longer use the Fourier inversion formulas for densities since the density becomes 
unbounded. But we can still employ the Fourier inversion formula for distributions instead. For 
the sake of the brevity we shall not pursue this possibility in this paper. Let Qo(v) an d Qi(v) be 
any two functions satisfying 

Qo( v ) < hif inf x)\, 

||u||=u sS5 

Qi(v) > sup sup \d s tp(u\s, x)\ = sup sup \d s (p(u\s, x)/<f>(u\s, x)\ 

\\u\\=v sGcS [|u[|=tJ sGcS 

with v £ M+. Functions Qo(v) and Qi(v) can be found if some prior bounds on the eigenvalues 
of the matrix are available. The following lemma can be used to construct Qo(v) and Qi(v). 

Lemma 3.3. Let X be a regular affine process with admissible characteristics \ = ( Q i ft-, 7j v) 
such that (jl]) is fulfilled, then the following estimates hold 

IMI^Uog^H^))! ~ A max (2l) / \ 2 ms ^{e m )dt, ||«|| 2 ->-oo, 

J o 

\\u\\^ 2 \d s ^{u\s,x)\ < A max (2l)A2 iax (e s(B ), ||u|| 2 -)• oo, 

where 21 := (a[f) m+1 <ij< d , 03 := {^) m +l<i,j<d and for any matrix a A max (a) stands for the 
maximal eigenvalue of a. 

Let us now formulate our estimation procedure. It will be, for reader convenience, separated 
into several steps. 

Step 1 Construct estimates for ip(u\s,x) and d s ip(u\s, x) as follows 

ijj N (u\s,x) = log(T S0 [4> N ](u\s,x)), 
ip StN (u\s,x) = T gi [(f> S)N /(f> N ](u\s,x), 



where 



1, \<Pn\ > 1, 

TpolM = i 4>n, go < \4>n\ < l, 

^qo[<Pn/\4>n\], \4>n\ < QO 



and 



T ei [(t>s,N/4>N] 



4>s,n/4>N, \<Ps,n\ < qi\^>n\, 

Qi[^s,n\^n\/{^n\^s,n\)], \<Ps,M > Ql\4>N\- 
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Step 2 Define 



^n(u) ■= I \iPs,n{u\0,x) - ^s,n(u + w\0,x) ) dw. 
-i,i] d 



Let JC(u) be a non- negative function supported on [—1, 1] that satisfies 

J K(u)du = l. 

For any U > put 

KF{u) = U^KiuU" 1 ) 
and define an estimate for the limit C as 



C 



U,N 



[JC u { Ul ) x ... x KF{u d )] $ N {u) du, 



Step 3 Reconstruct density p(°> (x) using the Fourier inversion formula 



(2ir) d 









/ p-iu % 




J-U 


l-u 





U,N 



du. 



In the next section we present uniform convergence rates for the estimate fffi (x; U). 

4 Theoretical properties 

First, the following lemma shows that both estimates iJjn(u\s, x) and ip s n(u\s, x) are uniformly 
consistent 

Lemma 4.1. If estimates (/>n(u\s,x) and <P S) n( u \s,x) fulfill dHJ)-©, then it holds for any fixed 
s £ S and i£P 

(12) sup w (\\u\\)\if} N (u\s,x) - i>(u\s,x)\ =O a . s .((N), 

u£R d L J 

(13) sup wi(\\u\\)\i>s !N (u\s,x) - d s ip(u\s,x)\ = .s.(Civ), 
ueR d L J 

where w (\\u\\) = q (\\u\\)w(\\u\\) and Wl (\\u\\) = ^(HD^^OMI) A l)w(\\u\\). 

Next we prove minimax upper and lower risk bounds for the estimate U). 

4.1 Upper risk bounds 

For any A>0, 1 < x < 2 and R > let A(A, x, R) stand for a class of regular affine models 
with admissible characteristics x = ( a i 7, v ) satisfying and 

1. It holds for any s£iS 

max |A max (2t) J* X 2 m!ix (e m ) dt, A max (2t)AL x (0} < A, 
where matrices 21 and *B are defined in Lemma 13.31 



2. For any M6l d 

(14) {f[ Uk ) • F[p( ° )](u) - 12 ' 

where measure p^ is related to the Levy measure via ()lip . 

Here and in the sequel J^ljo] (u) stands for the Fourier transform of a measure p. 
Remark 4.2. It can be shown that if the Levy measure satisfies 

k-h-,d ^ y{|a!i|>l ) ...,|a! fc _i|>l ) |x fc |<l,| a ! fe+ i|>l,... ) |a! d |>l} J 

i.e. the degree of jump activity of the each component of the process X(t) is less than 2 — x, 
then inequality ()14[) holds with some R > 0. 

Based on the above assumption and Lemma 14.11 the following proposition on a uniform 
convergence of the estimate f>ff(x; U) can be proved. 

Proposition 4.3. Let X(t) be a regular affine process with characteristics x = ( a iPi^, l/ ) £ 
A(A, x, R) and with a conditional characteristic function cj)(u\s,x). Suppose that 

(15) J \u\~ x \K{u)\du < oo. 

// the sequences of estimates {(Pn(u\s, x)} and {(() S) n(u\s,x)} for (p(u\s,x) and d s cf>(u\s,x) re- 
spectively are available and fulfill ©-([9]), then for any large enough U the estimate p^ 1 (x\U) 
satisfies 

(16) m V \pM(x)-pf\x;U)\ = O a . s . U N w~ l (U)U 2+d e Alj2 + U'^A . 
Discussion 

As will be shown in Section[SJ one can construct estimates </)n(u\s, x) and <j) s ^(u\s, x) such that 
([8]) and (|9j) hold with w{v) = min{l,t>~ 4 } and Ov = max{£o,7v, Ci,n} = 0{N~ r log 9 N) for some 
r > and q > 0. As a result the rates in Theorem 14.31 are logarithmic if A > 0. More precisely, 
setting 

U N := A- 1/2 yClogiV- X) + 3 + ri/2 + q^ log log JV, 

we get as A" — > oo 

(17) S uv\pW(x)-f$\x,U N )\ = O a . s ( (log A0~ ( ^ 1)/2 ). 

Several observations can be made from the inspection of these rates. First, the convergence 
rates are logarithmic and correspond to a severely ill-posed problem. The reason for the severe 
ill-posedness is that we face a deconvolution like problem: the law of the continuous part of 
X(T) is convolved (in generalized sense) with that of the jump part to give the distribution of 
X(T). Second, the nearer is x to 1 the more complex becomes the estimation problem. Since 
the degree of jump activity is equal to 2 — x this means that estimating the characteristics of 
jumps becomes more difficult as the degree of jump activity increases to 1. 
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4.2 Lower risk bounds 



The rates in (|17p are in fact optimal in the minimax sense for the class -4(A, x, R). 
Proposition 4.4. The following minimax lower risk bounds hold 

liminf inf sup P x ( (log N)^~ 1)/2 sup |p (0) (x) - $\x)\ > e ) > 0, 
where e is some positive number and is any estimator of based on N observations. 

5 Nonparametric estimation of and d s (f) 

In this section we present a method for estimating the conditional characteristic function </>(ti|s, x) 
and its derivatives in time from a time series of X(t). For our theoretical study we adopt the so 
called random design observational model, i.e. we assume that for some x £ T> a trajectory of 
the process X containing pairs 

(X(t n ),X{t n + 5 n )), X(t n ) = x, n=l,...,N, 

is available, where 5 n , n = 1, . . . , N are i.i.d. random variables on [0,T] for some fixed T > 
with a common bounded density ps(x) and min n (i n+ i — t n ) > 2T. This assumption implies that 
the time horizon T + tjv of observations tends to oo as N — > oo, a condition whic h is not unusual 
even i n the literature on statistical inference from high-frequency data (see, e.g. Figueroa-Lopezl 



( 20091 )). In Figure[T]a typical trajectory of a one-dimensional Ornstein-Uhlenbeck process (with- 



out jump component) is shown together with the level line x = 0. We estimate 4>n(u\s,x) and 




Time 



Figure 1: A typical path of the one-dimensional Ornstein-Uhlenbeck process dX{t) = 
-5X(t) dt + 3.5dW(t) along with the line x = 0. 
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d s 4>N(u\s, x) by a local linear smoothing of empirical characteristic process. Define 

N 

(18) (p N (u\s,x) := ^ To, n (s) exp(\u T X(t n + 6 n )), 

n=l 

N 

(19) ^ iA r(u|s,x) := T iA s ) exp(m T X(t n + 5 n )), 



n=l 



where 



bj, n (s) 
EfcLi b oA s ) 



and 



6 , n(s) = if (^V" ) 

61, n(s) = # f^V" ) 



N /s _ \ 

s N,j = ^2 K [ JL r^ ) (*n - i = 0,1,2. 



n=l 

Here if is a kernel and h is a bandwidth. Denote 



h~ 2 s N)1 h-*s N , 2 )' L{s) U( s ) M2 ( s ) 



with /i;(s) = J K z l K(z)ps(s + /iz) cfe. It easy to show that if matrix V is invertible then it holds 
(<pN, h(f) Sj ]Sf) = T~ l Y, where Y is a two-dimensional vector with components 



n = ^E-P(i»^( 4 „ + «)(^i)V(^i 

n=l v 7 v 



fc = 0, 1. 



In order to be able to prove the convergence of estimates (J)n(u\s,x) and ^> s ,n(u\s, x) we need 
the following assumptions 

Assumptions 

(AX1) The sequence X n := X (t n + 5 n ) — X (t n ) is exponentially strongly mixing, i.e. the mixing 
coefficients ax (see Appendix for definition) satisfy 

a x( n ) — aexp(-cn), n>l, 

for some a > and c > 0. 
(AX2) The Levy measure satisfies for some p > 2 

/ \\x\\ p ^°\dx) < oo. 

J{\\x\\>l} 
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(AS) The minimal eigenvalue of the matrix T(s) is bounded away from below by some positive 
constant 70 uniformly in s, i.e. 

min A min (f (s)) > 70 > 0. 
(AK) Kernel K is bounded and has a support on [—1, 1]. 

Remark 5.1. E xponentially strongly mixing holds for a wide class of Ito-Levy processes. In 



Masuda conditions are formulated that ensure that a multidimensional Ito-Levy pro- 



cess is exponentially /3-mixing (hence exponentially a- mixing). Suppose that X is a regular 
affine process with characteristics \ satisfying admissibility conditions, the condition Q and 
i|x|>i \ x \ q ^°\dx) < 00 f° r some q > 0. Moreover, assume that 

Pk}=^ 4m = °> i,kei:={i,...,d}\i, 

where X is a subset of {1, . . . , d}. This situation is typical for stochastic volatility models. If the 
maximal eigenvalue of the matrix {pfj)i s jex is negative then both sequences Xx{t n + S n ) and 
X x (t n + 5 n ) — Xf(t n ) are exponentially /3-mixing and ergodic. Taking the initial distribution for 
Xx(0) to be the invariant one, both sequences Xx(t n + 5 n ) and Xj(t n + 5 n ) — Xj(t n ) become 
stationary. 

Remark 5.2. Assumption (AS) imposes some regularity conditions on the design {5 n }^ =1 . In par- 
ticular, it ensures that points arbitrary close to appear in the sample with positive probability 
as N — > 00. This in turn allows us to consistently estimate the derivative d s (p(u\s, x)\ s= q. 

5.1 Asymptotic properties of <f> and <fi s . 

In this section we investigate the asymptotic properties of estimates <f>N and (j)s,N- Consider trun- 
cated versions 4>n{u) and 4> s ^(u) of estimates 4>n{u) and cj) s ^(u) respectively which are defined 
as follows. If the smallest eigenvalue of the matrix T is greater than 70/2 we set 4>n(u) = ^jv(ii) 
and cp s ^(u) = (p s ^(u). Otherwise, we put 4>n{u) = 4>s,n(u) = 0. The following proposition 
shows that both estimates 0at and (f)s,N are uniformly consistent in a weighted sup-norm. 

Proposition 5.3. Suppose that assumptions (AX1), (AX2), (AS) and (AK) are fulfilled and 
the bandwidth h^ satisfies h^ 1 = o(N). Let w be a positive Lip schitz function on 1R + such that 

(20) < w(z) < min{l, z~ 4 }, z G R+. 

Then for any fixed s G S and x G D 



(21) sup io(||it||)|^>jv(it|s, x) — 4>(u\s, x)\ 

u£_R d L 

(22) sup w(\\u\\)\(j> StN (u\s,x) - d s (j)(u\s,x)\ 



lo g (i+r)jV 



' log( 1+r ) N 



O a . s .\ x l^— + h N 



with some r > 0. 
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Discussion The right-hand sides of (|2ip and (|22|) are sums of two terms corresponding to 
"variance" and "bias" respectively. The optimal choice of the bandwidth parameter which 
minimizes (|2ip and (j22|) is given by 



h 



N 



N- 1 log (1+r) N 



1/5 



in which case uniform convergence rates are given by 



sup 



[u s.x 



O a . s . I (iV^ 1 log( 1+r ) iVj 



sup 



w{\W\\)\4>s,n{u\s,x) - d s (p(u\s,x)\ 



Oa.s. lo, 



2/5 



1/5 



Remark 5.4. The condition (j20|) on the decay of weighting function w can not be, in general, 
weakened. For example, in the case of a one-dimensional Brownian motion with volatility a 2 , 
the simplest affine process, we get d ss 4>(u\s, x)\ s= q = a 4 u 4 /A. This means that approximation 
errors of local linear estimates in (fl~8]) and (fl~9|) at the point are of order h 2 a 4 u 4 /8 and ha u /8 
respectively. So in order to be able to prove uniform consistency in u we have to assume (|20p . 



6 Numerical example 

Let us consider a class of stochastic volatility models of the type 



(23) 



dX(t) = --V(t)dt + ^JV{£)dW s {t) + dZ t , 
dV(t) = \{6 -V{t))dt + (^/7(tjdW v {t), 



where X(t) = log(S(t)) is the log-price process, W s and W v are two independent Brownian 
motions, A, 6, C are positive constants and Z t is a pu re-jump Levy process with Levy density 
v{x). This is a special type of the model introduced in Bates! (120051 ). In our numerical example 
we take Z(t) to be a-stable Levy process with stability index a < 1, i.e. 



v(x) = C/\x 



l+a 



p(x) 



2 1 



smx 



v(x) 



for some constant C > 0. For the sake of simplicity we consider a fixed design and simulate a 
set of i.i.d pairs 

(24) (lW(0),lW(A)), n = l,...,N 

with some fixed A > 0, where 

(1«(0),7W(0)) = ... = (XW(0),^(0)). 

Without loss of generality we set (X^ (0),V^ (0)) = (0, 1). Our aim is to reconstruct p using 
sample (f24"|) . First compute 



N 



$s,n{u\0) = (f) s , N {u\0) :-- 



-L ^ [exp (iu T X^(A)) - exp (m T X^)(0) 



71=1 
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and 



ip StN (u\0) - ^} 8) n(u + w\Q)) dw 



Remark 6.1. In the case of high-frequency data observations are usually available for different 
frequency scales A and the choice of an appropriate frequency for estimation procedure should 
be done depending on N, the number of points available for the given frequency scale. If A is 
too small then the variance of (f> s ,N{u\0) explodes. On the other hand, if A is too large than the 
approximation error of Si at(u|O) becomes large. 

Next define a parametric family of functions 



e -iux q, 

-u 



and find U by solving the following minimization problem 



U = arginf 




{\u\>U} 



V N (u) - * N (U) du + TT I \d xx f$(x; U)\ dx 



where tt > is a regularization parameter. In fact, this approach for choosing U employs 
additional information about smoothness of p and turns out to be rather efficient in practice. In 
Figure [2] typical results of estimation based on N = 1000 samples (|24p with A = 0.1 are shown 
for two specifications of the process Zf. As can be seen the overall quality of estimation is good 




Figure 2: Typical estimates for transformed Levy density p (dashed black line) together with 
the true p (solid red line) in the Bates stochastic volatility model with symmetric stable process 
Zt and different stability indexes a. 

taking into account severely ill-posedness of the underlying estimation problem. However, the 
behavior of the transformed Levy density p at zero is not captured by estimation method. In 
order to correct p{x) at x = we s eparately est i mate the stability index a using a modification of 
the spectral algorithm proposed in iBelomestnv! ifcood ) for Levy processes. Motivated by relations 
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([2]) and (J3]), we define for any a £ (0, 1) 



(25) 



O(o) := min / fy s ^(u\Q) — Z3ti a — I211 2 — l\u — Iq) 2 du 
(!o,h>hte) Jo 



and estimate a via a := argmin ae ( ^ 0(a). In Figure [3] functions 0{a) based on the same 
samples as in Figure [2] are shown. The resulting estimates for a are 5 = 0.451 and 5 = 0.783 



A 



0.2 0.4 0.6 0.8 



0.2 0.4 0.6 0.8 



Figure 3: Function 0{a) in the case of symmetric stable process Zt with stability indexes 0.5 
(left) and 0.8 (right) respectively. 

respectively. Now we correct estimate fffi \x\ U) by setting 

fl» {x . ^ = \<e) (1 - ^) M-( 1+5 \ \x\ < e, 
\tf$(x;U), \x\ > s. 

where for any e > the constant c(e) is chosen in such a way that function pff(x;U) is 
continuous. Finally, we find small enough e > which minimizes the integral J \d xx fffl(x; U) \ dx. 
Here again the smoothness of p is used. A corrected estimate p$(x; U) is shown in Figure HI 

7 Proofs 

7.1 Proof of Lemma 12.21 

Due to © all derivatives of functions Fq{z) and F\(z) up to order k exist for Hez E x {0} x 
. . . x {0}. Fix some s G S and x G V. Using (J2J) and Q, we get for 1 < I < k + 1 

(26) g^Mfl^ = (F (^i(n, S )),F (1) (Vi(n, S )), . . . , F^HM^ «)). 



Fi^^^Ff'^^s)),...,^ ^(^(u.sJ^Ha.a;), 
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a = 0.5 



a = 0.8 




-20 -10 10 20 -20 -10 10 



Figure 4: Corrected estimates for transformed Levy density p (dashed black line) together with 
the true p (solid red line) in the Bates stochastic volatility model with symmetric stable process 
Zt and different stability indexes a. 

where H is a polynomial of order I in M 21 . Note that under admissibility conditions and 
functions Fq and F\ simplify to 

d „ 

F (z) = Yl ^^j + iz,^-^ ^ J v (e zTu -l-(x(u),z)) u®(du) 



i,j=m+l 



Fu(z) = {<xfz,z)+Y j Z i fif}- 1 f\ j = l,...,m, 

i=l 



d 

F ij( z ) = j = m + l,...,d. 

i=m+l 



Solving the system of linear equations 

Fi,j(4>i(u,s)), fajfas) =\Uj, j =m + l,...,d, 



0s 

we get (ipi tm+ i(u, s), . . . , V'i,d(' u j s )) = e s2S (in m+ i, . . . ,iwrf) T . Furthermore, it follows from the 
theory of Riccati ODE that the solution of a system 

Fij(ipi(u, s)), tpi,j(u, s) = iu 



ds ~ 



can have at most linear growth in u. Since 

m a x{\\Fl l \z)\\,\F«\z)\}<\\z\\i ||z|| 2 ->oo, l = 0,...,k 

it holds 

max{||F 1 W(^ 1 («,a))|| l | J F W(V>i(« J a))|} < \\ipi(u, s)f 2 < \\u\\l \\u\\ 2 oo, l = 0,...,k + l. 
Combining the last inequality with ([26]) , we get ([7]) . 



16 



7.2 Proof of Proposition 13.31 

The analysis of the system of Riccati ODEs ([2]) and ([3]) performed in the proof of Lemma 12.2 
leads to bounds 

Re[^,(«,a)] = / Re[F (Mu,t))]dt > AL«(Ql) / A max (e i!B ) dt, 



Re[ipi(u, s)] = o ( 1 1 1 1 , 1 1 ti 1 1 2 — >■ 00. 



7.3 Proof of Lemma 14.11 

We prove only a more involved bound (|13|) . Since 

d s (f> - 4> s ,N(t>N = [d s 1p]<f> ~ $ s ,N<f>N 

= (d s il> - ■0 S) jv)0 + $s, n{4>- (J>n) 

we have 

d s ip - V^at = (j)" 1 (d s (f) - ^ S) n4>n) - ^ s ,n(4> ~ 4>n) 

and hence 

(27) \d s if) - i> s ,N\ < Qo 1 \d s <t> ~ $ s ,N<f>N\ + Qi\4> ~ 4>n\ 

Furthermore, it holds 



,N ~ ^s,n4>n\ < 



4>s,N ~ Qi\</>n\ 



4>s,N 



y s,N 



l(|<?i> s ,Jv|/|<?W| > Ql) 



< 



,n\ ~ Ql\(f>N\ l{\<t>s,N\/\4>N\ > Ql) 



W rrH<wv 



(28) < \a 8 ili\\<f>-<f> N \ + \<f> atN -a 8 <l>\. 

Combining ([2"Tj) and ([2"Hj) . we get 

|5 S V - i> 8 ,N\ < Qo 1 (u)[2gi(u)\(j) - (f> N \ + \(j) SiN - d s 
Now the assumptions (JHJ) and ([9]) imply 



(29) 



sup sup 



l«l(||u||)|^ s N (u\s,x) - d s 4>(u\s,x)\ = O a .s.(0v)- 



with wi(u) = qq{u){2 1 g 1 (u) A l)w(u). 
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7.4 Proof of Proposition 14.31 
Lemma 14.11 implies 



sup 



W1 (\\U\\)\* N (U) -*(U)\ = a . s ((N) 



Since 



(30) 



-U,N 



[JC u ( Ul ) x ... x K u (u d )) \ V N (u) - *(u)J du 
+ / [K. u (ui) x ... x JC u (u d )] -F[p (0) ](u) du 



and w\{z) is monotone decreasing, we have for any U > 
-C\< w^(U) [ [JC u ( Ul ) x ... x JC u (u d )} 



Then conditions (|14p and (|15p imply 



+ 



du. 



du < 



[K u (u 1 )x...xlC u (u d )] T[p^](u) 

R [ {ml'^IC 17 (ui)\ x ... x \u d \- x \KF (u d )\ du < CU' 



(31) 



with some constant C > 0. Combining (|3ip with ()30p . we get 

l^.iv - £| = O a . s (V'^Kiv + U~ dH 
Furthermore, using the Fourier inversion formula, we get 



dx 



S u V \p^{x)-^\x;U)\ < U a 



+ 



sup 

\\u\\<U 



+ \£u.n - C\ 



\u\\>U} 



Vx F[p^]{u)du 



< u° 



w^(\\U\\) sup 

\\u\\<U 



wA u 



+ \£u,n — £\ 



Recalling definition of w\ (see Lemma l4.ip and using Lemma \2.2\ we get . 
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7.5 Proof of Proposition 14.41 

In order to prove minimax lower bounds we apply general results from iTsvbakov (j200ah . Let e 
be a semi-parametric class of models. Consider a family {Pg,9 £ 0} of probability measures, 
indexed by 0. For any 9\, 02 £ let d(9i, 62) be a semi-distance between two models 9\ and 02. 

Lemma 7.1. Suppose that contains two elements 9\ and 92 such that d(0i,02) > 2s for some 
s > and x 2 (P^ N ,P^ N ) < r < 1/2, where 



x 2 (P,Q) 



for any two measures P and Q. Then 



/(g- 1 ) d Q> *f p «Q 

+00, otherwise 



inf supP fl (d(0,0) > s) > c(r) > 0, 

where c(r) is constant depending on r and infimum is taken over all estimates 9 of 9 based on 
N observations under Pg. 

Turn now to the construction of models 9\ and 02 from the class *4(A, x, R). Let us consider 
a symmetric stable Levy model with a nonzero diffusion part (a > 0) 

ip(u) =\fiu- a 2 u 2 /2 + 'd(u), -d(u) = -r)\u\ a , < a < 1, mEI. 

For any <5 satisfying < 5 < a and M > define 

i/js( u ) = ^ u ~ o~ 2 u 2 /2 + #,5 (it), 

with 

= -?/|u| a l{|n|<M} -7/M <5 |u| a_,5 l { | u | >M} . 
Then 4>$(u) = exp(ips(u)) is a characteristic function of some Levy process and 

<f> s (u) = <f>(u), \u\ < M, 

where </>(u) = exp(ip(u)). Indeed, $5(11) is continuous, non-positive, symmetric functio n which is 
conve x on R + for large enough M. According to the well known Polya criteria (see e.g. lUshakov 
(1999)), the function exp(£i9,5 (it)) is the c. f. of some absolutely continuous distribution for any 
£ > 0. In particular, for any natural n the function exp(i?5 (it) /n) is the c. f. of some absolutely 
continuous distribution. Hence, exp($$(u)) is the c.f. of some infinitely divisible distribution. 
Define now two affine (in fact, Levy) models 9\ and 02 corresponding to the characteristic 
exponents ip and ips respectively. Let vg 1 and vg 2 be the corresponding Levy measures. It holds 

x \P®\Pf») = N X \ Pe „ P62 ) = n f ^M-miv)? ^ 

where pg 1 and pg 2 are densities corresponding to c.f. (pg 1 and cpg 2 respectively. Using the asymp- 
totic inequality 

p$Ay) Z \y\" (a+1 \ \y\^oc 
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and the fact that the density of stable law does not vanish on any compact set in R, we derive 



N X (Wi.Pfla) < Nd \ P eM ~PeM\ dy 

'\v\<A 



+NC 2 [ \y\ a+1 \ Pei (y) -PeM\ 2 dy = Ndh + NCih 

J\v\>A 



'\y\>A 

for large enough A > and some constants C\, Ci > 0. Parseval's identity implies 
h < — [ \(f> 9l (u) - 4> e2 (u)\ 2 du 



< 



2tt 



oo 



|u|>M 



2vr 



< 



u|>M 



2 "^ 2 H 2 dn<Me- CT2M2 , M -> oo. 



|u|>M 



The choice M x [cj" 2 log (JV log' 3 AT)] 1/2 with some j3 > yields 

^X 2 (P0x,W 2 ) < 1/2 
for large enough N. On the other hand 



$(u) - & s (u) = -rj I [|u| a l { | M | >M} -\u + w\ a l { \ u+w \ >M} ] dw 

"1 r 



\u\ a -°l 



{\u\>M} 



\u + w\ a - d l 



{\u+w\>M} 



dw 



with 



#(u) := J [&(u) -&(u + w)] dw, 
$ s (u) := J [d s (u) -0 s (u + w)] dw. 



Using the identity 

(32) J ^ [|«| a l{| u | >M } ~\u + w\ a l{\ u +w\>M}] dw 



u 



-1 



\l-\l + w/u\ a ] dw = 2^ 



' a \ \u\°- 2k 



k=l 



2k J 2k + 1 



which holds for any \u\ > M + 1 and M > 1, we get 



- du 



du > M Q_1 , M -> oo 
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for any < a < 1. Denote 

P6l (x) := [ e iux $(u)du = ( 1 - 
p d2 (x) := f e ux V s {u)du= (1 

then the Fourier inversion formula implies that 



X 

sin a; 

x 



sup\p dl (x) - pe 2 (x)\ > 



[$(u) -$ s (u)]du 



> M , M^oo. 



Asymptotic expansion (|32|) shows that there is a constant R depending on r/ such that 

\u\ 2 - a $(u) < R, \u\ 2 - a $ 5 {u) <R, aeR. 
Hence, taking A = er 2 /2, x = 2 — a, we conclude that both models 9\ and 9 2 are in .4(A, x, -R). 

7.6 Proof of Proposition 15.31 

The smallest eigenvalue A m i n (r) of the matrix V satisfies 

Amin(r) 



min W T FW 

\W\\=1 



> min W 1 TW + min VF 1 (r - r)VF 

II W||=l l|VH=l 



(33) 



> min 

IIVK|| = 



w T rw- Yl \ r k,i-r k ,i\ 



Kk,l<2 



According to the assumption (AS) it holds min||^i| =1 W TW > 70. Fix some s > 0, k, I G {0, 1} 
and denote 



A„ := — 



1 (b. 



k+l 



K 



5n~ S 

h N 



z k+l K(z)p s (s + h N z) dz. 



We have E[A n ] = 0, 



and 



A n < h N l sup [(1 + \z\ 2 )K(z)] =: D^ 1 

zGK 



E[A n ] 2 < f z 2l+2k K 2 {z)p 5 {s + h N z)dz 



< 



h 



N JR 



{l + \z\ i )K z {z)dz=: D 2 h 



N 



with p max = max s6 5 ps(s) and some positive constant D\ and D 2 . Hence, due to the Bernstein 
inequality we get for any 5 > 



(34) 



P(|r*,i-r w |>*)=p(l 



N 



5> 

n=l 



> 5 < Lexp(-5 2 BNh N ) 
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with some positive constants L and B. Combining (|33p and (|34p . we get 

(35) P (A min (r) < 7o /2) < 4Lexp(- 7o 2 J BiV/ ii v/4). 
Due to the definition of estimates </>tv(u) and ^> s ,n{u) 

P (w(\\u\\)\4> N (u) - <f>(u)\ > 5) < P (A min (r) < 70 /2) 

(36) +P (w{\\u\\)\$ N {u) - 4>{u)\ > 5, A min (r) > 70/2) , 

P (w(\\u\\)\0 s>N (u) - d a <f>(u)\ > 5) < P (A min (r) < 70/2) 

(37) +P (w(\\u\\)\$ S)N {u) - d s <t>{u)\ > 5, A min (r) > 70/2) . 

Furthermore, the following representation holds on the set {A m i n (r) > 7o/2} 

(4> N (u\s,x) - (f>(u\s,x),h N ((f> S)N (u\s,x) - d s (f>(u\s,x))) = r~ 1 e A r(n) 

with 



1 N 

\k{u) := — ^2 [exp(m T X(i n + 6 n )) - <f>(u\s,x) - d s cp(u\s, x)(s - 5 n ) ir n , k (s 



n=l 

and 



hN \ h N J \ h N 



We have |7r ni fe| < n^h^ and 



E 



vr 2 log 2 ( 1+£ ) 7T 2 k ] < <K* 2 h^ log 2 ^ 1+£ \h N ), fc = 0, 1, e>0, 



with some positive constants 7r| and 7r|. The following lemma holds 
Lemma 7.2. Fix some (s,x) £ S xD and denote 

1 N 

Wn,i(u) := - J2* ntl (8)w(\\u\\)D n (u), 1 = 0,1, 

n=l 

where D n (u) := exp(vu T X n ) — e~ mT x 4>{u\5 n , x) . Let w be a Lipschitz continuous weighting func- 
tion satisfying 

< w(z) < min{l,log~ 1/2 ~ 5 (z)}, z G R+ 
/or arbitrary small 5 > 0. TTien /or large enough ( > 



(38) P ( sup |HV»| > U l ° g ^ N N ) < log^ d / 2 (N)N- K , N 



00 



with some k > 1, r > and Z = 0, 1. 
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Proof. Consider the sequence = e k , k G N and cover each cube [— A^^A^Y by = 
(L(2d 1//2 j4fc)/7j + l) d disjoint small cubes A^i, . . . ,Ak,M k , the edges of each cube being of the 
length 7/d 1 / 2 . Let itj^i, ■ ■ ■ , Uk,M k be the centers of these cubes. We have for any natural K > 

max sup IH^o^)! < max max | WN,o(uk,m)\ 



k=l,...,K 



A k -i<\\u\\<A k 



k=l,...,K ||u fe , m ||>A fe _i 

+ max max sup \Wn o(u) — Wn o(uk m)\- 

k=l,...,Kl<m<M ku&Akm 



Hence 



(39) P max sup \W N , {u)\ > A I < V V P(\W N , (u k , m )\ > X/2)+ 
\ fc=l,...,Jf A fc _i<||u||<A' ' - - 



fe=l {|K,mll>Afc-l} 
P 



sup \W N , (v)-W N ,o(u)\ > A/2 

u— ^||<7 i 



It holds for any u,v £ R rf 

\W N ,o(v)-W N , {u)\ < 2irth^ 1 \w(\\v\\) - w(\\u\ 



(40) 



+ ^71- ^ exp(u; T X n ) - exp(iu T X 
l\hN ^— J I 

n=l 

* N 

^-^|e— T ^H5 n ,x)-e 
1 N 



n=l 



< 27T^/l 7V 1 ||u — V 



- m ' x (i>(u\5 n ,x: 

N 



N 



n=l n=l 

where L u is the Lipschitz constant of w. The Markov inequality implies 



p ^Eoi^n - E \\x n \\] > ^ < c-*n-*e 



N 



Y^[\\x n \\-mx« 



n=l 



for any c > 0. Using now Dedecker and Rio inequalities (see Dedecker and et al, ( 20071 )) and 
taking into account assumptions (AX1)-(AX2), we get 

p 



E 



Y^[\\x n \\-nx n \ 



n=l 



< C p {f3)N p / 2 



where C p (j3) is some constant depending on /? and p from assumptions (AX1) and (AX2) re- 
spectively. Hence, 



(41) 



P^El|A„||>2/3 1 ,^ 



< C p (P)N~ p/2 /Pl n, 



where /3i 5 at = i Xm^=i ^ll^nll- Note that since X is ergodic it holds 



Pi,n — > P* = J II^IKC^) < 00, n — > 00, 
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where n is a unique invariant distribution. Setting 7 = A/(24-7rJ/i Ar 1 maxj/?!^, L w }) and com- 
bining (|4Up with the inequality (|41|) . we obtain 

(42) P ( sup \W Nfi (v) - W N ,o(u)\ > A/2 ] < B 1 N~ p / 2 

\\\u— u||<7 / 

with some constant B\ not depending on A and N. Let us turn now to the first term on the 
right-hand side of ([39]) . If ||wfc, m || > -A-k-i then it follows from Theorem 18.11 and Corollary 18.2 
(see Appendix) 

P(|Re[Wjv,oK m )]| > A/4) 

B-i\ 2 N \ 



< B2 exp 



4^ 2 (A fc _ 1 )log 2 ( 1+e )(/ iJv u;(^_ 1 ))7r 2 V/ lJ v + \log l (N)w(A k -.i)*t/h N 



P(|Im[Wjv,oKm)]| > A/4) 

B*\ 2 N 



< i?4 exp 



4u; 2 ( J 4 fc _ 1 )log 2 ( 1+e )(/ l7 v^(A fc _i))7r 2 V/i 7V + \ log 2 (N)w(A k ^i)ir*[/h N 



with some constants B2, -B3 and B4 depending only on the characteristics of the process X. 
Taking A = Qh~^ /2 \og^ l+e \h N )N~ 1 / 2 log 1 / 2 N with C > and using the fact that h' 1 = O(N), 
we get 

£ P(|HV,oK m )| > A/2) < {[(2d l / 2 A k )h\ + 1 

{||«fc,mll>^fc-l} 



x exp 



B*\ 2 N 



^{A^log^XhNwiAk^l/hN + Alog 2 (iVM4fe-iKM 



IV 



OO 



< exp (- 2(A ^f^— A J log^^(iV), iV 

\ u; 2 (^ fc _!) log 2 ( 1+£) (^(A-i)) / 

with r = 2(1 + e) and some constant -B > 0. Fix > such that B9 > d and compute 

£ PdWiV.oKm)! > A/2) < ^^5(^1)^/2 ^^-^^^^(fc-lJCf'log^-e) 

{||«fc,m||>^k-l} 

< e k(d-0B) j (r-l)d/2/jyN e -B(fc-l)(C 2 logiV-0)+dlog(iV)_ 

If £ 2 log N > 9 we get asymptotically 

£ £ P(|HV,oK m )| > A/2) < iog(-iW 2 (iV) e -(^^)iogW. 

fc=2{||wfe, m ||>A fc _i} 

Taking large enough £ > 0, we get ([38|) . □ 
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Combining Lemma 17.21 with the inequality 

\cj)(u\5 n ,x) - <f>(u\s,x) - d s 4>(u\s,x)(s - 5 n )\ < h 2 N sup\d ss <f>(u\s,x)\, \s - 5 n \ < h N 
and using Lemma 12.21 we obtain for large enough £ > 

< log- d/2 (iV)A^ K , iV^oo, A: = 0,1. 



P sup [ti;(||u||)|e fc («)|] >C 



'loe 1+r iV 



Nh 



+ h N 



N 



Finally, inequalities (j35|) . ([36]) and ([37|) together with the Borel-Cantelli lemma entail (j2"Tj) and 



8 Appendix 

For any two a algebras A and B, define the a-mixing coefficient by 
a z (A,B) = sup \P(Ar\B) -P(A)P(B)\. 

Let (Zfc, k > 1) be a sequence of real random variables defined on (f2, T, P). This sequence is 
called strongly mixing if 

az(n) = sup a(Mk,Qk+n) ->■ 0, n ->■ oo, 
fc>i 

w here M.j = o~(Zj, i < j) and Qj = o~ (Zj, i > j) for j > 1. The following theorem can be found 



m 



Merlevede. Peligrad and"Rk] $200$ ) 



Theorem 8.1. Let (Z^, k > 1) be a strongly mixing sequence of centered real valued random 
variables on the probability space {Vt^J-^V) with the mixing coefficients satisfying 



(43) 



a(n) < aexp(— cn), n > 1, a > 0, c > 0. 



Assume that sup fc>1 |Z^| < M a.s., then there is a positive constant C depending on c and a 
such that 



< 



exp 



i=l 



Nv 2 + M 2 + M( log 2 (N) 



for all C > and iV > 4, where 



:= sup [ E[Zi] 2 + 2 Cov(Zi, Zj) 



Corollary 8.2. Denote 



^=E[z|log 2 ( 1+£ )(|Z,.| 2 ) 



3 = 1,2,..., 
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with some e > and suppose that all pj are finite. Then 

max pz i 



for some constant C > provided that (|43p holds. Consequently the following inequality holds 



v 2 < sup I E[Zj] 2 + C m&xpZi 

i V i 



Proof. Due to Rio inequality 



r<x{\3-i\) 

\Cav(Zi,Zj)\ < 2 / Q Zi (u)Q Zj (u)du, 
Jo 

where for any random variable X we denote by Qx the quantile function of X. Define 



Px = E 



X 2 log 2 ( 1+£ ) (\X\ 2 ) 



for some e > 0. Markov inequality implies for small enough u > 



P |X| > 



1/2 



U 1 /2|log( ? x)|(l+^) 



< E 



X 2 log 2 ( 1+£ )(|X| 2 )) 



Px 



u- l \og^ 1+£ \u) 



PX 



II 



loK -2(l+ £ ) _ 

g \u\og 2 ^( U ) J 

log" 2 ( 1+£ ) (p x log- 2 ( 1+e )(n)) <u 



and therefore 



Hence 



Qx(u) < - 



1/2 

Px 



U 1 /2|log(«)|(l+^) 



ra(|j-*l) 107-07 

\Cov(Z l ,Z j )\<2 I : Jll du < 2^p^log^- 2£ («(|j - i\)) 



o u 



l og 2(l+e) ( , 



and 



^ Cov(^, Zj-) < C^pz-Yl \— ]\l+2e 
j>i j>i U 1 



with some constant C > depending on a. 



□ 
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